Recommendations for open data science
نویسندگان
چکیده
منابع مشابه
Recommendations for open data science
Life science research increasingly relies on large-scale computational analyses. However, the code and data used for these analyses are often lacking in publications. To maximize scientific impact, reproducibility, and reuse, it is crucial that these resources are made publicly available and are fully transparent. We provide recommendations for improving the openness of data-driven studies in l...
متن کاملOpen Data for Discovery Science
The modern healthcare and life sciences ecosystem is moving towards an increasingly open and data-centric approach to discovery science. This evolving paradigm is predicated on a complex set of information needs related to our collective ability to share, discover, reuse, integrate, and analyze open biological, clinical, and population level data resources of varying composition, granularity, a...
متن کاملOpen Data for Global Science
INTRODUCTION The global science system stands at a critical juncture. On the one hand, it is overwhelmed by a hidden avalanche of ephemeral bits that are central components of modern research and of the emerging ‘cyberinfrastructure’4 for e-Science.5 The rational management and exploitation of this cascade of digital assets offers boundless opportunities for research and applications. On the ot...
متن کاملAutOMAtING OPeN ScIeNce fOr BIG DAtA
the vast majority of social science research uses small (megabyteor gigabyte-scale) datasets. these fixedscale datasets are commonly downloaded to the researcher’s computer where the analysis is performed. the data can be shared, archived, and cited with wellestablished technologies, such as the Dataverse Project, to support the published results. the trend toward big data—including large-scale...
متن کاملRightinsight: open source architecture for data science
We give the details of our reference architecture called RightInsight for enabling rapid data science. RightInsight is based purely on open source technologies. The data is stored in a standard distributed file system such as HDFS. The stored data is processed in Apache Spark, which provides an enhanced Map/Reduce programming environment. Its rich and powerful machine learning base makes it eas...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: GigaScience
سال: 2016
ISSN: 2047-217X
DOI: 10.1186/s13742-016-0127-4